Picture for Bhaskar Ramasubramanian

Bhaskar Ramasubramanian

JobBench: Aligning Agent Work With Human Will

Add code
May 25, 2026
Viaarxiv icon

Polyhedral Instability Governs Regret in Online Learning

Add code
May 13, 2026
Viaarxiv icon

The WidthWall: A Strict Expressivity Hierarchy for Hypergraph Neural Networks

Add code
May 13, 2026
Viaarxiv icon

Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?

Add code
May 12, 2026
Viaarxiv icon

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Add code
May 29, 2025
Viaarxiv icon

SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge

Add code
May 27, 2025
Viaarxiv icon

Temporal Sampling for Forgotten Reasoning in LLMs

Add code
May 26, 2025
Viaarxiv icon

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Add code
May 20, 2025
Figure 1 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 2 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 3 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 4 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Viaarxiv icon

Small Models Struggle to Learn from Strong Reasoners

Add code
Feb 17, 2025
Figure 1 for Small Models Struggle to Learn from Strong Reasoners
Figure 2 for Small Models Struggle to Learn from Strong Reasoners
Figure 3 for Small Models Struggle to Learn from Strong Reasoners
Figure 4 for Small Models Struggle to Learn from Strong Reasoners
Viaarxiv icon

A Method for Fast Autonomy Transfer in Reinforcement Learning

Add code
Jul 29, 2024
Figure 1 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Figure 2 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Figure 3 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Figure 4 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Viaarxiv icon